Combining sequence and structure information in protein alignments
نویسندگان
چکیده
For distantly related proteins, alignmentsbased on structural information are more reliable than traditional sequence alignments. However, when structural comparison leaves some ambiguity in alignment, sequence information can provide valuable additional information to discriminate between multiple alternatives. In this paper we present a Bayesianmodel that incorporates sequence information into structural alignments in an automatic and adaptive fashion. By use of an estimated measure of conservation between sequence and structure, we construct an ensemble sequence/structure alignment tool capable for building refined protein alignments and identifying conserveed functional regions. This model also provides a natural tool for using structural alignment to determine evolutionary distance, which may allow phylogenetic analysis of proteins at significantly larger divergence times. Two examples are presented and compared with previous analysis in the
منابع مشابه
3DCoffee: combining protein sequences and structures within multiple sequence alignments.
Most bioinformatics analyses require the assembly of a multiple sequence alignment. It has long been suspected that structural information can help to improve the quality of these alignments, yet the effect of combining sequences and structures has not been evaluated systematically. We developed 3DCoffee, a novel method for combining protein sequences and structures in order to generate high-qu...
متن کاملCombining the GOR V algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence.
We have modified and improved the GOR algorithm for the protein secondary structure prediction by using the evolutionary information provided by multiple sequence alignments, adding triplet statistics, and optimizing various parameters. We have expanded the database used to include the 513 non-redundant domains collected recently by Cuff and Barton (Proteins 1999;34:508-519; Proteins 2000;40:50...
متن کاملCombining evolutionary information and neural networks to predict protein secondary structure.
Using evolutionary information contained in multiple sequence alignments as input to neural networks, secondary structure can be predicted at significantly increased accuracy. Here, we extend our previous three-level system of neural networks by using additional input information derived from multiple alignments. Using a position-specific conservation weight as part of the input increases perfo...
متن کاملCOMBOSA3D: combining sequence alignments with three-dimensional structures
UNLABELLED COMBOSA3D is a program that allows sequence conservation to be viewed in its proper three-dimensional environment. It superimposes sequence alignment information onto a protein structure using a customizable color scheme, which is also applied to a textual sequence alignment for reference. AVAILABILITY The program can be tested at http://www.bioinformatics.org/combosa3d/, and the s...
متن کاملIterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases
MOTIVATION Sequence alignment techniques have been developed into extremely powerful tools for identifying the folding families and function of proteins in newly sequenced genomes. For a sufficiently low sequence identity it is necessary to incorporate additional structural information to positively detect homologous proteins. We have carried out an extensive analysis of the effectiveness of in...
متن کامل